Every stochastic game with perfect information admits a canonical form

نویسندگان

  • Endre Boros
  • Vladimir Gurvich
  • Khaled Elbassioni
  • Kazuhisa Makino
چکیده

We consider discounted and undiscounted stochastic games with perfect information in the form of a natural BWR-model with positions of three types: VB Black, VW White, VR Random. These BWR-games lie in the complexity class NP∩CoNP and contain the well-known cyclic games (when VR is empty) and Markov decision processes (when VB or VW is empty). We show that the BWR-model is polynomial-time equivalent with the classical Gillette model, and, as follows from a recent result by Miltersen (2008), with simple stochastic games (so called Condon’s games), as well. Furthermore, we consider standard potential transformations rx(v, u) = r(v, u) + x(v) − βx(u) of the local reward function r, where β ∈ [0, 1) is the discount factor and β = 1 in the undiscounted case. As our main result, we show that every BWR-game can be reduced by such a transformation to a canonical form in which locally optimal strategies are globally optimal, and hence the value for every initial position and the optimal strategies of both players are obvious. Standardly, the optimal strategies are uniformly optimal (or ergodic, that is, do not depend on the initial position) and coincide with the optimal strategies of the original BWR-game; while the original values are transformed by a very simple formula: μx(v) = μ(v) + (1− β)x(v). In the discounted case, β < 1, the transformed values are also ergodic and the corresponding potentials can be found in polynomial time. Yet, this time tends to infinity, as β → 1−.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Determinacy of Infinite Games with Eventual Perfect Monitoring

An infinite two-player zero-sum game with a Borel winning set, in which the opponent’s actions are monitored eventually but not necessarily immediately after they are played, admits a value. The proof relies on a representation of the game as a stochastic game with perfect information, in which Nature operates as a delegate for the players and performs the randomizations for them.

متن کامل

Perfect-Information Games with Lower-Semicontinuous Payoffs

We prove that every multi-player perfect-information game with bounded and lower-semi-continuous payoffs admits a subgame-perfect ε-equilibrium in pure strategies. This result complements Example 3 in Solan and Vieille (2003), which shows that a subgame-perfect ε-equilibrium in pure strategies need not exist when the payoffs are not lower-semi-continuous. In addition, if the range of payoffs is...

متن کامل

Perfect Correlated Equilibria in Stopping Games

We prove that every undiscounted multi-player stopping game in discrete time admits an approximate correlated equilibrium. Moreover, the equilibrium has three appealing properties: trembling-hand perfectness players do not use non-credible threats; normal-form correlation communication is required only before the game starts; uniformness it is an approximate equilibrium in any long enough niteh...

متن کامل

The (non-)existence of perfect codes in Lucas cubes

A Fibonacci string of length $n$ is a binary string $b = b_1b_2ldots b_n$ in which for every $1 leq i < n$, $b_icdot b_{i+1} = 0$. In other words, a Fibonacci string is a binary string without 11 as a substring. Similarly, a Lucas string is a Fibonacci string $b_1b_2ldots b_n$ that $b_1cdot b_n = 0$. For a natural number $ngeq1$, a Fibonacci cube of dimension $n$ is denoted by $Gamma_n$ and i...

متن کامل

On Canonical Forms for Two-person Zero-sum Limiting Average Payoff Stochastic Games

We consider two-person zero-sum mean payoff undiscounted stochastic games. We give a sufficient condition for the existence of a saddle point in uniformly optimal stationary strategies. Namely, we obtain sufficient conditions that enable us to bring the game, by applying potential transformations to a canonical form in which locally optimal strategies are globally optimal, and hence the value f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009